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CONFIDENCE  INTERVALS  AND  SAMPLE  SIZE 


1.  INTRODUCTION 

The  confidence  interval  is  a  useful  tool  in  the  analysis  and  presentation 
of  test  results.  It  is  an  estimate  of  a  range  of  values  that  is  stated  to 
contain  the  value  of  some  numerical  characteristic  of  a  statistical  population, 
the  interval  in  any  given  instance  being  computed  from  a  specific  sample  taken 
at  random  from  the  population.  In  some  cases  the  confidence  interval  is  specified 
and  sample  size  determined.  The  probability  that  the  statement  is  true  is  the 
"confidence  coefficient"  associated  with  the  given  confidence  interval.  This 
is  the  equivalent  to  saying  that  a  given  percent  confidence  interval  places  a 
certain  value  between  two  fixed  "confidence  limits". 

'  '/Many  of  the  misunderstandings  and  misuses  of  confidence  intervals  can 
be  avoided  if  the  following  properties  are  kept  in  mind:  (l)  the  interval  is  a 
statement  about  a  characteristic  of  a  statistical  population,  (2)  the  interval 
is  derived  from  a  random  sample  of  that  population  and  is  subject  to  sampling 
variation,  i.e.,  another  sample  and  a  repetition  of  the  test  would  be  expected 
to  yield  a  different  interval;  and  (3)  the  confidence  coefficient  is  not 
associated  with  a  particular  interval  but  with  the  particular  method  of  calculat¬ 
ing  it.  (  J  v. 

2.0  CONCEPTS 

Population  characteristics  such  as  the  mean  and  standard  deviation  can 
rarely  be  determined  exactly  because  of  the  unavailability  of  all  the  individuals 
in  the  population,  the  expense  of  examining  every  individual,  or  the  destructive 
nature  of  the  examination.  Consequently  population  characteristics  must  be  in¬ 
ferred  from  an  examination  of  a  small  part  of  the  population  -  a  randomly  selected 
sample.  The  general  character  of  the  population  often  is  known  or  can  safely  be 
assumed  to  be  of  a  type  that  can  be  described  by  a  known  mathematical  expression 
(distribution  function)  containing  one  or  more  unknown  constants  (parameters). 

When  that  is  the  case,  estimation  of  properties  of  the  population  is  reduced  to 
a  problem  of  estimation  of  the  unknown  parameters  of  the  distribution  functions. 

The  estimates  are  obtained  by  selecting  a  sample,  performing  the  experiment  or 
test,  making  the  required  observations  (e.g.,  measuring  a  dimension  or  counting 
the  number  of  individuals  in  one  of  two  categories),  and  making  the  calculations 
implied  by  an  appropriate  formula.  Thus  a  single-valued  estimate  (usually  called 
a  point  estimate)  of  the  parameter  is  obtained.  Experience  verifies  that  a 
repetition  of  those  steps  for  the  same  population  almost  always  yields  a  differ¬ 
ent  result  and  a  different  estimate  of  the  same  parameter.  Because  of  this 
statistical  variation  exhibited  by  single-valued  estimates,  an  interval  estimate 
that  could  take  into  account  that  variation  would  be  preferred  in  certain 
applications. 
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The  derivation  of  a  useful  interval  estimate  from  a  single  sample 
is  possible.  The  procedure  makes  use  of  probability  theory  which,  in  turn, 
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supposes  that  some  knowledge  of  the  distribution  of  the  population  is  available. 

For  example,  if  the  population  is  normally  distributed  and  a  random  sample  is 
available,  confidence  intervals  for  the  mean  and  for  the  standard  deviation  can 
be  calculated.  Although  the  confidence  interval  is  a  statement  that  the  parameter 
lies  within  the  interval,  it  is  not  an  absolute  inequality.  An  essential  part 
of  the  statement  is  the  confidence  coefficient.  (The  confidence  interval  is 
meaningless  without  the  confidence  coefficient.)  Since  the  interval  is  derived 
from  a  sample,  repeated  sampling  and  calculation  would  be  expected  to  yield  dif-  < 

ferent  intervals.  Some  of  them  will  bracket  the  parameter  and  some  will  not. 

The  confidence  coefficient  is  simply  the  proportion  of  such  intervals  -'assuming  1- 

the  entire  sampling,  calculating,  etc. ,  process  were  repeated  a  large  number  of 
times  -  that  may  be  expected  to  bracket  the  population  parameter.  A  particular 
interval,  however,  either  brackets  the  parameter  or  it  does  not.  The  confidence 
coefficient  is  descriptive  of  the  entire  process  rather  than  of  a  single  interval. 

Prior  to  the  conduct  of  sampling  and  experimentation  it  is  appropriate 
to  consider  the  probability  that  an  interval  calculated  from  the  outcome  of  the 
experiment  will  bracket  the  population  parameter.  That  probability  is  numerically 
equal  to  the  confidence  coefficient.  After  the  observations  are  obtained,  however, 
the  chance  elements  are  no  longer  present  and  the  concept  of  probability  is  no 
longer  applicable.  The  confidence  coefficient  associated  with  a  particular 
interval  must,  therefore,  not  be  interpreted  as  a  probability. 

2.1  STEPS 

Procedures  for  calculating  confidence  intervals  are  described  in  a 
through  d  below.  Paragraphs  3-1  through  3.1.6  give  step-by-step  examples  of 
procedures  for  determining  confidence  intervals  in  seven  common  situations.  In 
every  instance  it  is  assumed  that  the  observations  from  a  random  sample  are 
available.  The  number  of  observations  is  denoted  by  n,  the  observations  are 
called  xi  (the  subscript  i  may  be  omitted  when  no  ambiguity  results),  and  the 
confidence  coefficient  is  y . 


a.  The  first  step  is  to  decide  on  the  appropriate  type  of  confidence 
interval.  A  confidence  interval  may  be  "two  sided"  or  "one  sided".  A  two-sided 
interval  states  that  the  parameter  lies  between  two  specific  values  inside  the 
possible  range  of  the  parameter.  If  the  interval  states  merely  that  the  para¬ 
meter  is  greater  than  some  specific  value  with  no  statement  about  an  upper  limit, 
it  is  called  a  lower  confidence  limit.  This  would  be  a  one-sided  interval.  An 
upper  confidence  limit  is  similarly  defined.  The  graphical  aids  in  the  Appendix 
(with  one  exception)  are  made  specifically  for  two-sided  intervals.  They  can, 
however,  be  used  for  calculation  of  one-sided  intervals  provided  a  simple  adjust¬ 
ment  of  the  confidence  coefficient  is  made. 

b.  The  confidence  coefficient  is  selected.  Typical  coefficients  in 
common  use  are  0.90,  O.95,  and  O.99.  In  general,  the  larger  the  confidence  co¬ 
efficient,  the  wider  the  interval  will  be.  Since,  in  a  sense,  the  coefficient 
is  a  measure  of  the  "strength"  of  the  statement,  that  relationship  is  clearly 
consistent.  The  graphs  in  Figures  1  through  4  show  curves  for  confidence  co¬ 
efficients  of  0.80,  0.90,  0.95,  and  0.99  for  two-sided  intervals.  These  same 
curves  are  for  0.90,  0.95,  0.975,  and  0.995  (respectively)  for  one-sided  intervals. 
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c.  The  appropriate  table  is  entered  to  obtain  one  or  more  factors. 

Each  table  is  designed  for  ease  of  use.  Each  has  been  derived  from  an  accepted 
statistical  table,  such  as  the  Student-t  ,  Chi  Square,  standard  normal,  etc. 

d.  The  formulas  incorporating  the  observations  and  the  factors  from, 
c  are  calculated. 

2.2  USES 

Confidence  intervals  are  most  frequently  used  in  the  analysis  of  test 
results,  the  application  illustrated  in  this  pamphlet.  They  also  may  be  used  in 
the  planning  of  experiments,  as  an  aid  in  determining  the  number  of  samples 
needed  to  produce  a  desired  interval  with  a  specified  confidence  coefficient. 
Dependence  of  the  interval  on  the  test  results  greatly  limits  this  application. 

For  a  few  intervals,  however,  the  dependence  is  not  a  limitation.  For  example 
in  paragraph  3»1  the  width  (2KcrQ)  of  the  two-sided  interval  for  the  mean  does 
not  depend  on  the  observations.  If  an  interval  of  a  specified  width  is  desired, 
various  combinations  of  y  and  n  can  be  found  to  produce  that  width.  (The  limits 
of  the  graph  will  restrict  the  number  of  combinations.)  Similarly,  in  paragraph 
3.1.3  the  ratio  of  the  upper  to  the  lower  limits  of  the  two-sided  interval  for 
the  standard  deviation  is  independent  of  the  observations  and  depends  only  on  y 
and  n.  For  the  majority  of  experiments,  however,  the  planner  is  obliged  to  employ 
assumed  results  in  order  to  use  the  confidence  interval  as  an  aid  in  determining 
sample  size. 

'l 

As  a  tool  for  analyzing  data  generated  in  planned  experiments,  the 
confidence  interval  appears  simple  and  objective.  There  is  thus  a  growing 
tendency  to  rely  heavily  on  it  to  the  exclusion  of  other  valid  methods  of 
analysis.  A  simple  statistical  technique  can  rarely  take  into  account  all  the 
information  relevant  to  the  evaluation  of  the  performance  of  a  complex  item. 

In  the  testing  of  materiel  one  often  wishes  to  obtain  high  assurance  of  very 
low  (or  very  high)  incidence  of  bad  (or  good)  performance.  An  example  is  the 
desire  for  high  assurance  of  nearly  no  premature  functioning  of  high-explosive 
ammunition.  If  a  firing  test  is  made,  the  confidence  interval  for  the  expected 
premature  rate  derived  by  the  methods  of  paragraph  3.1*6  is  always  disturbingly 
large  even  when  no  premature  firing  occurs.  The  use  of  this  statistical 
technique  alone  will  not  provide  the  desired  result.  One  must  recognize  that 
such  tests  can  only  support  or  refute  an  expectation  of  low  incidence  of  premature 
firing  based  on  previous  experience  and  engineering  analyses.  In  essence,  one 
is  applying  an  "a  priori"  assumption  concerning  the  characteristics  of  the  test 
item  -  an  assumption  not  taken  inso  account  by  the  statistical  technique.  A 
discussion  of  this  limitation  in  the  use  of  statistical  methods  is  contained 
in  Reference  4-. 

3.0  EXAMPLES 

3.1  Mean  of  a  Normal  Population,  Standard  Deviation  Known 

From  prior  experiments  or  for  theoretical  reasons  the  standard  devia¬ 
tion  of  the  population  is  known.  An  experiment  is  conducted  to  determine  the 
mean  of  the  population. 
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p,  =  unknown  population  mean 
c0  =  known  standard  deviation 
n  =  number  of  observations 
Xj_  (i  =  1,  2,  n)  =  the  observations 

X  =  i  •£  Xj_,  the  sample  mean 

a.  The  type  of  confidence  interval  appropriate  for  the  desired  state¬ 
ment  is  determined: 

1)  A  two-sided  interval  if  it  is  desired  to  'place  p,  between  two 
finite  limits. 

2)  A  one-sided  interval  if  it  is  desired  to  place  a  lower  or 
upper  limit  on  p,. 

The  confidence  coefficient  applies  to  any  one  of  the  three  possible  intervals 
but  not  to  two  or  more  simultaneously. 

b.  The  desired  confidence  coefficient,  y,  is  selected.  Values  of  y 
available  from  the  graphs  are  0.80,  0.90,  0.95>  and  0*99  for  two-sided  inter¬ 
vals  and  0.90,  O.95,  0.975,  and  0.995  for  one-sided  intervals. 

c.  The  curves  of  Figure  1  are  used  to  obtain  a  factor  K  (ordinate) 

by  selecting  the  appropriate  curve  and  entering  the  graph  with  n  as  the  abscissa 
Curves  are  labeled  with  the  confidence  coefficients  for  two-sided  intervals.  For 
a  one-sided  interval  the  curve  is  selected  as  follows:  If  y'  is  the  desired  con 
fidence  coefficient,  use  the  curve  for  y  =  2y'  -  1. 

d.  Confidence  intervals  are  calculated  by  substitution  in  the  appro¬ 
priate  formula: 

X  -  Kao  £  p,  £  Y  +  Kctq  two- sided  interval 
X  -  Ka0  £  p,  one-sided  interval  (lower  limit) 

p,  £  T  +  Kc0  one-sided  interval  (upper  limit) 

Illustration:  Muzzle  velocities  of  8  rounds  of  projectile,  105-mm, 

HE,  Ml  selected  at  random  from  one  lot  of  ammunition  (Lot  A)  were  measured  to 
estimate  the  lot  average.  Records  of  firings  of  other  lots  from  this  production 
show  that  the  standard  deviation  of  velocity  is  4  fps.  A  two-sided  95  percent 
confidence  interval  for  the  mean  of  Lot  A  is  desired. 

p,  =  unknown  mean  of  Lot  A,  fps 

a0=  4  fps 

n  =  8 

-4- 


MTP  3-1-002 
25  January  1967 


The  observed  muzzle  velocities . are  as  follows: 


Round 

No. 


Muzzle  Velocity, 
fps 


X  =  12322  =  1514.9. 6 


Reference  to  the  curve  for  y  =  O.95  in  Figure  1  gives  K  =  0.69  when  n  =  8.  The 

confidence  interval  is  obtained  by  substitution  in  the  formula  in  d. 

15^9.6  -  O.69  (It)  s  ^  s  1549.6  +  O.69  (4) 

1546.8  S(i  s  1552.4 

It  is  asserted  with  95  percent  confidence  that  the  average  muzzle  velo¬ 
city  of  rounds  of  Lot  A  is  between  1546.8  and  1552.4  fps. 


3.1.1 


Mean  of  A  Normal  Population.  Standard  Deviation  Unknown 


The  experiment  is  conducted  to  determine  the  mean  of  the  population. 
Data  from  the  experiment'  are  used  to  derive  an  estimate  of  the  population  stan¬ 
dard  deviation,  i.e.,  no  prior  knowledge  of  the  standard  deviation  is  required. 

p  =  unknown  population  mean 

n  =  number  of  observations 

Xj_  (i  =  1,  2,  . .  n)  =  the  observations 

1  _ 

X  =  TT  2  Xj_ ,  the  sample  mean 

S  _  /  ~  ,  the  sample  standard  deviation 

v.f  1 — _ 

or  S  =  /nSX  -  (2  X)’  _  computing  formula 
V  n(n  -  l) 


a.  The  type  of  confidence  interval  appropriate  for  the  desired  state¬ 
ment,  is  determined: 


9 


ouna  s^a^ion  wnen  T/ne  crew  ae^errameo.  tne  crai^  w  i 
e-sided  (upper),  95  percent  confidence  interval  for 
sired.  No  prior  knowledge  of  the  standard  deviatloi 

p,  =  unknown  population  mean  of  hor: 
between  aircraft  and  ground  sti 

n  =  6 


The  observations  are  as  follows: 


Run  No. 


1 

2 

3 

5 

6 


Horizo: 

Distan 


50 

73 

^3 

5^ 

6o 

37 
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X  =  511  =  52.8 

Ss=  n  S  X2  -  (S  X)3  =  6(17563)  -  (317)2  »  l62>97 

n(n  -  l)  30 

S  =  12.77 


The  appropriate  curve  in  Figure  2  is  selected.  Since  a  one-sided 
interval  with  95  percent  confidence  is  required,  the  curve  is  taken  as 
2(0.95)  -  1  =  0.90,  i.e.,  the  curve  y  =  0.90  is  used.  For  n  =  6,  K  =  0.82. 

The  confidence  interval  for  p  is 

p  £  T  +  KS 

p  <;  52.8  +  0.82  (12.77) 

n  *  63.3 


It  is  asserted  with  95  percent  confidence  that  the  average  horizontal 
distance  between  aircraft  and  ground  station,  when  the  aircraft  crew  determines 
that  the  craft  is  over  the  station,  is  no  greater  than  63.3  meters. 

3.1.2  The  Standard  Deviation  Of  A  Normal  Population,  Mean  Known 

From  prior  experiments  or  for  theoretical  reasons  the  mean  of  the 
population  is  known.  An  experiment  is  conducted  to  estimate  the  standard  devia^ 
tion  of  the  population. 


Xj.  (i  = 


pQ  =  known  population  mean 
a  -  unknown  standard  deviation 
n  =  number  of  observations 
1,  2,  ....,  n)  =  the  observations 


7_ 


2(Xi  -  p 0Y‘ 


,  the  sample  standard  deviation 


do  >  computing  formula 


a.  The  type  of  confidence  interval  appropriate  for  the  desired  state 
ment  is  determined: 


l)  A  two-sided  interval  if  it  is  desired  to  place  a  between  two 
positive,  finite  limits. 


•a."* 
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2)  A  one-sided  interval  if  it  is  desired  to  place  a  lower  positive 
limit  on  a,  or  to  place  an  upper  limit  on  c. 

The  confidence  coefficient  applies  to  any  one  of  the  three  possible  intervals  but 
not  to  two  or  more  simultaneously. 

b.  The  desired  confidence  coefficient,  y,  is  selected.  Values  of  y 
available  from  the  graphs  are  0.80,  0.90,  0.95,  and  0.99  for  two-sided  intervals 
and  0.90,  0.95,  0.975,  and  0.995  Tor  one-sided  intervals. 

c.  Figure  3  contains  a  pair  of  curves  for  each  value  of  y  .  If  a  two- 
sided  interval  is  desired,  a  value  Kx  is  read  from  the  lower  curve  of  the  pair 
and  Kg  is  read  from  the  upper  curve.  If  a  one-sided  interval  is  desired,  the 
value  is  for  the  lower  limit  and  Kg  is  for  use  in  determining  an  upper  limit. 

The  abscissa  is  given  as  "degrees  of  freedom  for  S. "  When  the  formula  for  S 
given  in  this  section  is  used,  degrees  of  freedom  for  S  is  equal  to  n.  The  values 
of  v  are  for  two-sided  intervals.  For  a  one-sided  interval  the  curve  is  selected 
as  follows:  If  y'  is  the  desired  confidence  coefficient,  use  the  curve  for 

y  =  2y'  -  1. 

d.  Confidence  intervals  are  calculated  by  substitution  in  the  appropriate 

formula: 


3-1.3. 


iq  S  s  <j  £  l^S  two-sided  interval 
Kj S  £  ct  one-sided  interval  (lower  limit) 

o  £  KgS  one-sided  interval  (upper  limit) 

The  Standard  Deviation  Of  A  Normal  Population.  Mean  Unknown 


An  experiment  is  conducted  to  estimate  the  standard,  deviation  of  the 
population.  The  mean  is  unknown  and  will  be  estimated  from  the  data. 


y,  =  unknown  population  mean 
a  -  unknown  population  standard  deviation 


n  =  number  of  observations 


(i  =  1,  2,  n)  =  the  observations 

"X  =  ^  E  X^,  the  sample  mean 


s  .  I 

V  n  -  1 


or  S 


/  nE  Xi3-  (E 
V  n(n  -  l) 


,  the  sample  standard  deviation 

XJ3 

— —  ,  computing  formula 


a.  The  type  of  confidence  interval  appropriate  for  the  desired  state¬ 
ment  is  determined: 


Q 
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1)  A  two-sided  interval  if  it  is  desired  to  place  a  between  two 
positive,  finite  limits. 

2)  A  one-sided  interval  if  it  is  desired  to  place  a  lower  positive 
limit  on  a,  or  to  place  an  upper  limit  on  <3 

The  confidence  coefficient  applies  to  any  one  of  the  three  possible  intervals 
but  not  to  two  or  more  simultaneously. 

b.  The  desired  confidence  coefficient,  y,  is  selected.  Values  of  y 
available  from  the  graphs  are  0.80,  0.90,  O.95,  and  0.99  for  two-sided  inter¬ 
vals  and  0.90,  0.95,  0.975 ,  and  0.995  for  one-sided  intervals. 

c.  Figure  3  contains  a  pair  of  curves  for  each  value  of  y.  If  a  two- 
sided  interval  is  desired,  a  value  is  read  from  the  lower  curve  of  the  pair 
and  K,  is  read  from  the  upper  curve.  If  a  one-sided  interval  is  desired,  the 
value  Kj  is  for  the  lower  limit  and  K»,  is  for  use  in  determining  an  upper  limit. 
The  abscissa  is  given  as  "degrees  of  freedom  for  S."  When  the  formula  for  S  given 
in  this  section  is  used,  degrees  of  freedom  for  S  is  equal  to  n  -  1.  (Note:  The 
formula  in  this  section  differs  from  the  formula  for  S  given  in  3*4.  When  the 
population  mean  is  unknown  and  estimated  from  the  observations,  the  number  of 
linearly  independent  variables  in  S  is  reduced  by  one.  Hence  degrees  of  freedom 
is  one  less  than  the  sample  size.)  The  values  of  y  are  for  two-sided  intervals. 
For  a  one-sided  interval  the  curve  is  selected  as  follows:  If  y'  is  the  desired 
confidence  coefficient,  use  the  curve  for  y  =  2y'  -  1. 

d.  Confidence  intervals  are  calculated  by  substitution  in  the  appro¬ 
priate  formula: 

Kl  S  so  <;  KaS  two-sided  interval 
Kx S  £  a  one-sided  interval  (lower  limit) 

a  i  KjS  one-sided  interval  (upper  limit) 

Illustration:  The  ability  of  a  military  vehicle  to  travel  through  soft 
soil  may  be  judged  by  driving  the  vehicle  along  an  appropriate  course.  The 
strength  of  the  soil  of  the  course  may  be  inferred  from  cone  penetrometer  tests 
made  immediately  prior  to  the  vehicle  operation.  The  cone  penetrometer  result 
is  expressed  as  a  cone  index.  Uniformity  of  soil  conditions  over  the  test  site 
is  desirable.  An  estimate  of  variation  in  the  course  is  to  be  inferred  from  the 
variation  in  cone  index  from  location  to  location  along  the  course.  Measure¬ 
ments  were  made  at  15  locations.  From  these  a  two-sided  confidence  interval  for 
the  standard  deviation  of  cone  index  would  be  made.  A  90  percent  level  was 
chosen.  The  cone  indexes  are:  25,  40,  4-2,  34,  33,  39,  32,  4l,  34,  44,  31,  24, 

48,  23  and  33. 

a  =  unknown  population  parameter,  standard  deviation  of  cone 
index 
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/  nZ  X1 2  -  (E  X)£ 
V  n(n-  l) 


15(19011)  -  (523)‘ 
15(14) 


=  7.4 


Referring  to  Figure  3>  Y  =  0.90  gives  =0.77  and  =  1.46  for 

degrees  of  freedom  n  -  1  =  l4. 

KiSiasKjS 

0.77(  7.4)  s  a  s  1.46(7.4) 

5.7  s  a  s  10.8 


3.1.4  Mfference  Between  Means  of  Two  Normal  Populations.  Standard  Deviations 
Known.  Sample  Sizes  Equal 

An  experiment  is  conducted  to  estimate  the  difference  between  the  means 
of  two  normal  populations,  population  A  and  population  B.  From  prior  experiments 
or  for  theoretical  reasons  the  standard  deviations  of  the  populations  are  known. 
The  sample  sizes  are  equal,  i.e.,  the  number  of  observations  from  population  A 
is  the  same  as  the  number  from  population  B.  (This  restriction  on  sample  size 
is  made  in  order  to  employ  a  simple  graphical  aid. ) 


xi  (i 
Yi  (i 


Mi  and  ^ 

ax  and  aa 
n 


1,  2,  . . . . , 

1,  2,  . . . . , 


=  unknown  means  of  population  A  and  B,  respectively 
=  known  standard  deviations 

=  number  of  observations  from  each  population 
n)  =  observations  from  A 
n)  =  observations  from  B 


_  1 


X  -  n  ^ 


Y  =  i  Hi 


sample  means 


n 


a.  The  type  of  confidence  interval  appropriate  for  the  intended  state¬ 
ment  is  selected. 


1)  A  two-sided  interval  is  the  only  appropriate  interval  if  the 

sign  of  the  difference  ^  ^  is  of  no  interest. 

2)  Either  the  two-sided  or  a  one-sided  interval  may  be  appropriate 
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if  the  difference  of  interest  was  selected  before  the  experiment 
was  run  or  the  sign  of  the  difference  is  a  matter  of  consideration 
3)  The  two-sided  interval  places  mi  “  Me  between  two  finite  limits, 
whereas  a  one-sided  interval  places  either  a  lower  or  upper 
limit  on  mi  -  Me  • 

b.  The  desired  confidence  coefficient,  y,  is  selected.  Values  of  y 
available  from  the  graphs  are  0.80,  0.90,  O.95,  and  O.99  for  two-sided  intervals 
and  0.90,  0.95,  0.975,  and  0.995  for  one-sided  intervals. 

c.  The  curves  of  Figure  1  are  used  to  obtain  a  factor  K  (ordinate)  by 
selecting  the  appropriate  curve  and  entering  the  graph  with  n  as  the  abscissa. 
Curves  are  labeled  with  the  confidence  coefficients  for  two-sided  intervals. . 

For  a  one-sided  interval  the  curve  is  selected  as  follows!  If  y  is  the  desired 
confidence  coefficient,  use  the  curve  for  y  =  2y'  -  1. 

d.  Confidence  intervals  are  calculated  by  substitution  in  the  appro¬ 
priate  formula: 

Two-sided  interval  (3T-*Y)  -  K 

One-sided  interval  (r-T)--  k 
(lower  limit) 

One-side^  interval 
(upper  limit) 

Illustration:  Armor  plate  of  two  different  aluminum  alloys  was  tested 
to  determine  the  difference  between  the  alloys  in  resistance  to  penetration  by 
cal  .30  armor-piercing  projectiles.  Three  sample  plates  of  each  alloy,  all 
having  the  same  areal  density,  were  tested  to  find  V-50  ballistic  limits.  Inform¬ 
ation  from  prior  testing  of  similar  material  indicates  that  the  standard  de¬ 
viation  of  ballistic  limits  would  be  30  fps  for  either  alloy  when  tested  with 
this  projectile.  It  is  desired  to  estimate  the  difference  between  mean  ballistic 
limits  of  the  two  alloys  with  95  percent  confidence  limits.  A  two-sided  interval 
is  considered  appropriate  for  the  purpose  of  the  test. 


Mj  =  unknown  mean  ballistic  limit  of  Alloy  A 
Mg  =  unknown  mean  ballistic  limit  of  Alloy  B 
°i  =  CTs  =  30  fps 

The  test  ballistic  limits,  in  feet  per  second,  are  as  follows: 


Alloy  A 
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Y  =  4770/3  =  1590  (Alloy  A) 

T  =  4572/3  =  1524  (Alloy  fl) 

In  Figure  1  the  curve  for  y  =  0.95  is  selected.  The  value  K  =  1.13  is 
obtained  for  n  =  3.  The  two-sided  interval  is 

(3T  -  Y)  -  K  Jo,3  +  o3  *  (X  -  Y)  +  K  Jo,3  +  ass 

(1590  -  1525)  -  1.13  V900  +  900  £  1 y*  ~  Ps  £  (1590-1524)+1.13  V 900+900 

66  -  1.13(42.4)  <;  1^-  Jia  s  66  +  1.13(42.4) 

18  s  ^  -  na  s  114 


It  is  asserted  with  95  percent  confidence  that  the  difference  between 
population  means  of  ballistic  limits  of  these  two  aluminum  alloys  when  tested  with 
cal  .30  armor-piercing  projectiles  is  between  18  and  ll4  fps. 

3.1.5  Difference  Between  Means  of  Two  Normal  Populations.  Standard  Deviations 
Unknown  But  Equal.  Sample  Sizes  Equal 

An  experiment  is  conducted  to  estimate  the  difference  between  the  means 
of  two  normal  populations,  populations  A  and  B.  Standard  deviations  are  unknown 
but  can  be  assumed  to  be  the  same  for  both  populations.  The  sample  sizes  are 
equal,  i.e.,  the  number  of  observations  from  population  A  is  the  same  as  the 
number  from  population  B.  (This  restriction  on  sample  size  is  made  in  order  to 
employ  a  simple  graphical  aid. ) 

•  and  Hg  =  unknown  means  of  populations  A  and  B,  respectively 
n  =  number  of  observations  from  each  population 
X±  (i  =  1,  2,  . . . . ,  n) 

(i  =  1,  2,  ....,  n) 

~X 
Y 


=  observations  from  A 


-  observations  from  B 


“  n  Hi 


=  -  E  Yi 
n  x 


sample  means 


Si2  = 


E(Xi  -~T}3 


sample  variance  from  A 
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n  Z  Xjs  -  (atj)2  ^  computing  formula 
n(n  -  l) 

E(Yi  -  Y)2 

_  ,  sample  variance  from  B 

n  -  1 


L 

a.  The  type  of  confidence  interval  appropriate  for  the  intended  state¬ 
ment  is  selected. 

1)  A  two-sided  interval  is  the  only  appropriate  interval  if  the 
sign  of  the  difference  ^ is  of  no  interest. 

2)  Either  the  two-sided  or  a  one-sided  interval  may  be  appropriate 
if  the  difference  of  interest  was  selected  before  the  experiment 
was  run  or  the  sign  of  the  difference  is  a  matter  of  consideration. 

3)  The  two-sided  interval  places  yi  -  Hg  between  two  finite  limits, 
whereas  a  one-sided  interval  places  a  lower  or  upper  limit  on 

b.  The  desired  confidence  coefficient,  y,  is  selected.  Values  of  y 
available  from  the  graphs  are  0.80,  0.90,  0.95>  and  0.99  for  two-sided  intervals 
and  0.90,  0.95,  0.975,  and  O.995  for  one-sided  intervals. 

c.  The  curves  of  Figure  4  are  used  to  obtain  a  factor  K  (ordinate)  by 
selecting  the  appropriate  curve  and  entering  the  graph  with  n  as  the  abscissa. 

Curves  are  labeled  with  the  confidence  coefficients  for  two-sided  intervals.  For 
a  one-sided  interval  the  curve  is  selected  as  follows:  If  y'  is  the  desired  con¬ 
fidence  coefficient,  use  the  curve  for  y  =  2y'  -  1. 

d.  Confidence  intervals  are  calculated  by  substitution  in  the  appro¬ 
priate  formula: 

(JC  -  Y)  -  KSp  £  14  -  Hg  £  (Y  -  Y)  +  KSp  two-sided  interval 

(Y  -  Y)  -  KSp  ^  Pq.  -  Ug  one-sided  interval  (lower  limit) 

yj  -  (Xg  5  (X  -  Y)  +  KSp  one-sided  interval  (upper  limit) 

Illustration:  Preliminary  information  indicated  that  the  addition  of 
a  certain  wear  reducing  agent  to  the  standard  propelling  charge  for  175-nun 
ammunition  produced  an  increase  in  initial  blast  overpressure.  An  estimate 
of  the  increase  was  desired.  Eight  rounds  with  the  standard  charge  and  eight 
with  the  agent  added  to  the  standard  charge  were  fired.  Blast  overpressure 
occuring  at  a  selected  position  near  the  weapon  was  recorded  for  each  round. 

(For  simplicity  of  the  illustration  only  one  location  is  considered.)  It  is 
desired  to  place  95  percent  confidence  limits  on  the  difference  between  means  of 
blast  overpressure.  The  standard  deviations  are  unknown  but  believed  to  be  equal. 


-13- 
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The  blast  overpressures  (psi)  are  as  follows: 


Standard  Charge  With  Additive 


Standard  Charge 


2.26 

1.17 

2.16 

1.05 

2.07 

1.23 

2.11 

1.03 

1.98 

1.27 

2.36 

1.21 

2.16 

1.31 

2.19 

1.12 

^  =  unknown  mean  overpressure,  charge  with  additive 
=  unknown  mean  overpressure,  standard  charge 
n  =  8 

X  =  IJu-ffi.  =  2.16  (with  additive) 

8 

Y  =  9«3 9  =  1.17  (standard) 

8 

o  8(37.4619)  -  (17.29)2  _  0  Q1?u 

*  ~ - rff) -  '  • 

„  a.  8(11.0927)  •  (9.39)a  ,  0.0102 

^  8(7) 


S  =  /0.0134  +  0.0102  =  0.11 

p  N  2 

Reference  to  Figure  4  for  n  =  8  and  y  =  0.95  gives  K  =  1.07.  The  two- 
sided  confidence  interval  is 

(2.16  -  1.17)  -  1.07(0.11)  £  ^  *  (2.16  -  1.17)  +  1.07(0.11) 

O.87  ^  (Jii  -  (ip  ^1.11 

It  is  asserted  with  95  percent  confidence  that  the  average  increase  in  blast  over¬ 
pressure,  at  the  specific  location,  resulting  from  the  use  of  this  wear-reducing 
agent  is  between  O.87  and  1.11  psi. 

3.1.6  Binomial  P 

When  the  result  of  a  single  trial  in  an  experiment  is  one  of  two  mutually 
exclusive  categories  such  as  "success"  or  "failure",  and  the  probability  of  failure 
in  a  single  trial  is  constant  from  trial  to  trial,  the  binomial  distribution  is 
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applicable.  This  distribution  contains  one  parameter,  P  (0  i  P  *  l),  which  is 
the  probability  of  failure  in  a  single  trial.  If  n  is  the  number  of  trials  con¬ 
ducted,  the  probabilities  of  the  number  of  failures  (P,  1,  2,  . ..,  n)  can  be  cal¬ 
culated  in  terms  of  n  and  P.  (in  this  discussion  P  is  probability  of  failure, 
but  it  could  be  probability  of  success  by  redefinition  of  categories). 

Since  P  is  a  population  parameter  it  is  usually  unknown.  Interval  esti¬ 
mates  of  P  can  be  obtained  by  observing  the  number  of  successes  in  a  sample  of  n 
trials.  The  calculations,  however,  are  not  simple  for  the  general  case.  There¬ 
fore  only  one  special  situation  is  treated  in  this  pamphlet.  This  is  the  case 
in  which  P  is  expected  to  be  close  to  zero.  The  extremes  of  the  possible  range 
of  P  are  of  most  frequent  interest  in  testing  materiel.  Often  P  represents  the 
probability  of  failure  of  an  item  when  it  is  employed  in  a  specified  environment. 

An  experiment  consisting  of  several  trials  is  performed  in  order  to  find 
an  upper  confidence  limit  on  P. 

n  =  number  of  trials 

K  =  number  of  failures  observed 

P  =  unknown  population  parameter,  probability  of  failure 
in  a  single  trial 

Pu  -  upper  confidence  limit  for  P 

a.  The  figure  corresponding  to  K  (0,  1,  2,  3,  or  4)  is  selected  from 
among  Figures  5  through  9. 

b.  The  desired  confidence  coefficient  is  selected:  y  =  0.90,  0.9?,  or 

0 • 99 •  /  - ,  ,  \ 

c.  The  graph  is  entered  with  n  as  abscissa  and  the  value  Pu  \ ordinate; 
is  read  from  the  curve  for  the  selected  value  of  y. 

d.  The  statement  is  made  that  P  SPU  with  100  y  percent  confidence. 

Illustration:  A  sample  of  48  projectiles,  HE,  M437  selected  from  one 
production  lot  were  subjected  to  a  standard  laboratory  vibration  schedule  to  deter 
mine  what  proportion  of  that  lot  could  be  expected  to  suffer  damage  in  that 
environment.  (Production  is  assumed  to  be  large  relative  to  the  sample  size  of 
48.)  After  the  vibration  test  was  completed,  inspection  of  the  projectiles  re¬ 
vealed  that  none  of  the  48  was  damaged. 

P  =  unknown  binomial  parameter  representing  probability  of 
an  individual  projectile's  being  damaged 

n  =48 

K  =  0 

A  95  percent  upper  confidence  limit  on  P  is  desired.  The  appropriate 
graph  for  K  =  0  and  y  =  0.95  is  found  in  Figure  5-  The  graph  is  entered  with 
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n  =  48  as  the  abscissa,  and  a  value  Pu  (ordinate)  is  read  from  the  curve  for 
Y  =  0.95  as  0.06(j..  .The  interval  is  P  £  0.060.  It  is  asserted  with  95  percent 
confidence  that  the  probability  of  a  projectile  of  this  lot's  being  damaged  in 
this  vibrational  environment  is  no  greater  than  0.060. 


m*..*-V*  .*.>  • „  t _ V.- J  *•!*..  ■* !>_"**■  I *^i.i*^* 


L-Xv.rvv 
I  vVW 
hi  . 
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APPENDIX  A 


GRAPHS  FOR  CALCULATING  CONFIDENCE  INTERVALS 
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Figure  1.  Factors  K,  for  Estimating  the  Mean,  p,,  of  a  Normal 

Population  -  Population  Standard  Deviation,  a,  Known. 
Confidence  Coefficients  (Two-Sided  Interval)  y  =  0.80, 
0.90,  O.95,  and  0.99. 
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X  -  Y  -  KSp  <  iUj  -  Hz  <  X  -  Y  +  KSp 


2  3  4  5  10  20  30  40  50  100 

Sample  Size,  n 


Figure  4.  Factors,  K,  for  Estimating  the  Difference,  ^ 

Between  Means  of  Two  Normal  Populations  -  Standard 
Deviations  Unknown  But  Equal;  Equal  Sample  Sizes. 
Confidence  Coefficients  (Two-Sided  Interval) 
v  =  0.80,  0.90,  and  O.99. 
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Figure  5.  Upper  Confidence  Limit,  Pu,  of  Binomial  P  (P  = 
Probability  of  Failure  in  One  Trial)  When  No 
Failure  is  Observed  in  n  Trials.  Confidence  Coefficients 
y  =  0.90,  0.95;  and  0.99* 
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Figure  6.  Upper  Confidence  Limit,  of  Binomial  P  (P  = 
Probability  of  Failure  in  One  Trial)  When  One 
Failure  is  observed  in  n  Trials.  Confidence 
Coefficients  y  =  0.90,  0.95;  and  0.99* 
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Figure  8.  Upper  Confidence  Limit,  Pu,  of  Binomial  P  (P  = 
Probability  of  Failure  in  One  Trial)  When  Three 
Failures  are  Observed  in  n  Triads.  Confidence 
Coefficients  y  =  0.90,  0.95,  and  0.99* 


